sranjans : Semantic Textual Similarity using Maximal Weighted Bipartite Graph Matching
نویسندگان
چکیده
The paper aims to come up with a system that examines the degree of semantic equivalence between two sentences. At the core of the paper is the attempt to grade the similarity of two sentences by finding the maximal weighted bipartite match between the tokens of the two sentences. The tokens include single words, or multiwords in case of Named Entitites, adjectivally and numerically modified words. Two token similarity measures are used for the task WordNet based similarity, and a statistical word similarity measure which overcomes the shortcomings of WordNet based similarity. As part of three systems created for the task, we explore a simple bag of words tokenization scheme, a more careful tokenization scheme which captures named entities, times, dates, monetary entities etc., and finally try to capture context around tokens using grammatical dependencies.
منابع مشابه
Semantic-based Optimal XML Schema Matching: A Mathematical Programming Approach
We propose a novel solution for semantic-based XML schema matching, taking a mathematical programming approach. This method identifies the globally optimal solution for the problem of matching two XML schemas by reducing the tree-to-tree matching problem to simpler problems of pathto-path, node-to-node, and word-to-word matching. We formulate these matching problems as maximum-weighted bipartit...
متن کاملjanardhan: Semantic Textual Similarity using Universal Networking Language graph matching
Sentences that are syntactically quite different can often have similar or same meaning. The SemEval 2012 task of Semantic Textual Similarity aims at finding the semantic similarity between two sentences. The semantic representation of Universal Networking Language (UNL), represents only the inherent meaning in a sentence without any syntactic details. Thus, comparing the UNL graphs of two sent...
متن کاملNTNU-CORE: Combining strong features for semantic similarity
The paper outlines the work carried out at NTNU as part of the *SEM’13 shared task on Semantic Textual Similarity, using an approach which combines shallow textual, distributional and knowledge-based features by a support vector regression model. Feature sets include (1) aggregated similarity based on named entity recognition with WordNet and Levenshtein distance through the calculation of maxi...
متن کاملAn Improved Light-weight Matchmaking Mechanism for Discovering Owl-s Services Based on Sparql, Bipartite and Nlp Approach
Semantic Web services integrate the meaningful content of the Semantic Web with the business logic of Web services and thus enable industries and individuals to access these services. But as the number of available Web services increase, there is a growing demand for a mechanism for effective retrieval of required services. We propose an improved Semantic Web service discovery method for findin...
متن کاملGMO: A Graph Matching for Ontologies
Ontology matching is an important task to achieve interoperation between semantic web applications using different ontologies. Structural similarity plays a central role in ontology matching. However, the existing approaches rely heavily on lexical similarity, and they mix up lexical similarity with structural similarity. In this paper, we present a graph matching approach for ontologies, calle...
متن کامل